Abstract:The internal covariant shift of the data and the long distance dependence of the sequence data are not taken into account in the existing deep learning based compound protein interaction prediction methods. To solve the problem, a method for compound-protein interaction prediction based on graph attention network and simple recurrent unit is proposed. The graph attention network-gated recurrent unit is introduced to learn the graph-level representation of compound molecules, the multi-layer-simple recurrent unit is employed to learn feature vector representation of amino acid subsequences, and multilayer-feed-forward network is utilized to predict compound-protein interactions. Experiments show that the evaluation indexes of the proposed method are improved on 2 public datasets, and the effectiveness of the proposed method is verified.
[1] 薛 斌,李益洲,李梦龙.基于化学信息学方法预测药物副作用的研究进展.计算机与应用化学, 2019, 36(5): 487-490. (XUE B, LI Y Z, LI M L. Research Advance in the Drug Side-Effect Prediction Based on Chemoinformatic. Computers and Applied Chemistry, 2019, 36(5): 487-490.) [2] YABUUCHI H, NIIJIMA S, TAKEMATSU H, et al. Analysis of Multiple Compound-Protein Interactions Reveals Novel Bioactive Molecules. Molecular Systems Biology, 2011. DOI:: 10.1038/msb.2011.5. [3] 宗可昕,张 晶,陈云雨,等. PLK1 PBD靶向抗肿瘤抑制剂的筛选及活性研究.中国医药生物技术, 2016, 11(1): 13-20. (ZONG K X, ZHANG J, CHEN Y Y, et al. Screening and Anti-tumor Activity of PLK1 PBD Targeted Inhibitors. China Medical Biotechnology, 2016, 11(1): 13-20.) [4] 孙国章,何 莹,尹照瑞,等.灯盏花素与牛血清白蛋白相互作用的光谱学研究.山西中医药大学学报, 2020, 21(3): 192-195. (SUN G Z, HE Y, YIN Z R, et al.Spectroscopic Study of the Interaction between Breviscapine and Bovine Serum Albumin. Journal of Shanxi University of Chinese Medicine, 2020, 21(3): 192-195.) [5] VILAR S, HRIPCSAK G. Leveraging 3D Chemical Similarity, Target and Phenotypic Data in the Identification of Drug-Protein and Drug-Adverse Effect Associations. Journal of Cheminformatics, 2016, 8. DOI: 10.1186/s13321-016-0147-1. [6] MOLINA D M, JAFARI R, IGNATUSHENKO M, et al. Monitoring Drug Target Engagement in Cells and Tissues Using the Cellular Thermal Shift Assay. Science, 2013, 341(6141): 84-87. [7] 张建锋.在线微透析电化学发光传感器研究药物蛋白相互作用.硕士学位论文.西安:陕西师范大学, 2012. (ZHANG J F. Coupling On-line Microdialysis Sampling with the Electrogenerated Chemiluminescence Sensor for Studying Drug Protein Interactions. Master Dissertation. Xi'an, China: Shaanxi Normal University, 2012.) [8] 李 玲,李佳蔚,张月梅.基于分子对接预测靶点ACE2和IL-6R研究归芪白术方治疗新型冠状病毒肺炎的物质基础及其作用机制.甘肃中医药大学学报, 2020, 37(2): 1-9. (LI L, LI J W, ZHANG Y M. Material Basis and Action Mechanism of Guiqi Baizhu Fang(归芪白术方) in the Treatment of COVID-19 Based on Molecular Docking Prediction for Target ACE2 and IL-6R. Journal of Gansu University of Chinese Medicine, 2020, 37(2): 1-9.) [9] MA D L, CHAN D S H, LEUNG C H. Drug Repositioning by Structure-Based Virtual Screening. Chemical Society Reviews, 2013, 42(5): 2130-2141. [10] 贾聪敏.基于分子振动特征的药物靶点识别及活性预测模型研究.硕士学位论文.北京: 北京中医药大学, 2019. (JIA C M. Research on Drug Target Recognition and Activity Prediction Model Based on Molecular Vibration Characteristics. Master Dissertation. Beijing, China: Beijing University of Chinese Medicine, 2019.) [11] CHOW E, KLEPEIS L. 9.6 New Technologies for Molecular Dynamics Simulations. Comprehensive Biophysics, 2012, 9: 86-104. [12] JACOB L, VERT J P. Protein-Ligand Interaction Prediction: An Improved Chemogenomics Approach. Bioinformatics, 2008, 24(19): 2149-2156. [13] BLEAKLEY K, YAMANISHI Y. Supervised Prediction of Drug-Target Interactions Using Bipartite Local Models. Bioinformatics, 2009, 25(18): 2397-2403. [14] COELHO E D, ARRAIS J P, OLIVEIRA J L. Ensemble-Based Methodology for the Prediction of Drug-Target Interactions // Proc of the 29th IEEE International Symposium on Computer-Based Medical Systems. Washington, USA: IEEE, 2016: 36-41. [15] TIAN K, SHAO M Y, WANG Y, et al. Boosting Compound-Protein Interaction Prediction by Deep Learning. Methods, 2016, 110: 64-72. [16] NGUYEN T, LE H, QUINN T P, et al. GraphDTA: Predicting Drug-Target Binding Affinity with Graph Neural Networks. Bioinformatics, 2020. DOI: 10.1093/bioinformatics/btaa921. [17] TSUBAKI M, TOMII K, SESE J. Compound-Protein Interaction Prediction with End-to-End Learning of Neural Networks for Graphs and Sequences. Bioinformatics, 2019, 35(2): 309-318. [18] KIPF T N, WELLING M. Semi-Supervised Classification with Graph Convolutional Networks[C/OL]. [2020-11-09]. https://arxiv.org/pdf/1609.02907.pdf. [19] LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-Based Lear-ning Applied to Document Recognition. Proceedings of the IEEE, 1998, 86(11): 2278-2324. [20] VELIČKOVIĆ P, CUCURULL G, CASANOVA A, et al. Graph Attention Networks[C/OL]. [2020-11-09]. https://arxiv.org/pdf/1710.10903.pdf. [21] CHO K, VANMERRIENBOER B, GULCEHRE C, et al. Learning Phrase Representations Using RNN Encoder-Decoder for Statistical Machine Translation // Proc of the Conference on Empirical Me-thods in Natural Language Processing. New York, USA: ACM, 2014: 1724-1734. [22] LEI T, ZHANG Y. Training RNNs as Fast as CNNs[C/OL]. [2020-11-09]. https://arxiv.org/pdf/1709.02755v1.pdf. [23] ULYANOV D, VEDALDI A, LEMPITSKEY V. Instance Normalization: The Missing Ingredient for Fast Stylization[C/OL]. [2020-11-09]. https://arxiv.org/pdf/1607.08022v3.pdf. [24] BAHDANAU D, CHO K, BENGIO Y. Neural Machine Translation by Jointly Learning to Align and Translate[C/OL]. [2020-11-09]. https://arxiv.org/pdf/1409.0473v5.pdf. [25] LIU H, SUN J J, GUAN J H, et al. Improving Compound-Protein Interaction Prediction by Building up Highly Credible Negative Samples. Bioinformatics, 2015, 31(12): 221-229.